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Foreword 



id , 



This Technical Specification has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

x the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 



Introduction 



The present document specifies the full rate speech transcoding within the digital cellular telecommunications system. 

NOTE: The present document is a reproduction of recommendation T/L/03/1 1 "13 kbit/s Regular Pulse 
Excitation - Long Term Prediction - Linear Predictive Coder for use in the digital cellular 
telecommunications system". 

A 3,5 inch diskette (Annex 4 informative) is supplied with the present document, the diskette contains test sequences, as 
described in clause 5. 

The diskette contain files labelled as follows: 

Diskette 1 ETS 300 580-2, annex 4: Test sequences for the GSM Full Rate speech codec; Test sequences 

SEQ0 1 .xxx to SEQ05 .xxx. 

The specification from which the present document has been derived was originally based on CEPT documentation, 
hence the presentation of the present document may not be entirely in accordance with the ETSI/PNE Rules. 
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0.1 Scope 



The transcoding procedure specified in the present document is applicable for the full-rate traffic channel (TCH) in 
digital cellular telecommunications system. The use of this transcoding scheme for other applications has not been 
considered. 

In recommendation GSM 06.01, a reference configuration for the speech transmission chain of the digital cellular 
telecommunications system is shown. According to this reference configuration, the speech encoder takes its input as a 
13 bit uniform PCM signal either from the audio part of the mobile station or on the network side, from the PSTN via 
an 8 bit/A-law to 13 bit uniform PCM conversion. The encoded speech at the output of the speech encoder is delivered 
to a channel encoder unit which is specified in Rec.GSM 05.03. In the receive direction, the inverse operations take 
place. 

The present document describes the detailed mapping between input blocks of 160 speech samples in 13 bit uniform 
PCM format to encoded blocks of 260 bits and from encoded blocks of 260 bits to output blocks of 160 reconstructed 
speech samples. The sampling rate is 8000 sample/s leading to an average bit rate for the encoded bit stream of 13 
kbit/s. The coding scheme is the so-called Regular Pulse Excitation - Long Term prediction - Linear Predictive Coder, 
here-after referred to as RPE-LTP. 

The recommendation also specifies the conversion between A-law PCM and 13 bit uniform PCM. Performance 
requirements for the audio input and output parts are included only to the extent that they affect the transcoder 
performance. The recommendation also describes the codec down to the bit level, thus enabling the verification of 
compliance to the recommendation to a high degree of confidence by use of a set of digital test sequences. These test 
sequences are also described and are available in the diskette (informative annex 4) which is attached to the back cover 
of the present document. 



0.2 References 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[1] GSM 01.04 (ETR 100): " Digital cellular telecommunications system (Phase 2); Definitions, 

abbreviationsand acronyms". 

[2] GSM 05.03 (ETS 300 575): " Digital cellular telecommunications system (Phase 2); Channel 

coding". 

[3] GSM 06.01 (ETS 300 580-1): " Digital cellular telecommunications system (Phase 2); Full rate 

speech processingfunctions". 

[4] GSM 11.10 (ETS 300 607): " Digital cellular telecommunications system (Phase 2); Mobile 

Station (MS) conformity specification". 

[5] ETS 300 085: "Integrated Services Digital Network (ISDN);3,lkHz telephony teleservice 

Attachment requirements for handset terminals (Candidate NET 33)". 

[6] ITU-T Recommendation G.71 1 : "Pulse code modulation (PCM) of voice frequencies". 

[7] ITU-T Recommendation G.71 2: "Transmission performance characteristics of pulse code 

modulation". 
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[8] ITU-T Recommendation G.726: "40, 32, 24, 16 kbit/s adaptive differential pulse code modulation 

(ADPCM)". 

[9] ITU-T Recommendation Q.35: "Technical characteristics of tones for the telephone service". 

[10] ITU-T Recommendation V.21: "300 bits per second duplex modem standardised for use in the 

general switched telephone network". 

[11] ITU-T Recommendation V.23: "600/1 200-band modem standardised for use in the general 

switched telephone network". 



0.3 Abbreviations 

Abbreviations used in the present document are listed in GSM 01.04. 

1 Description 

1.1 void 

1.2 Outline description 

The specification is structured as follows: 

Section 1.3 contains a functional description of the audio parts including the A/D and D/A functions. Section 1.4 
describes the conversion between 13 bit uniform and 8 bit A-law samples. Sections 1.5 and 1.6 present a simplified 
description of the principles of the RPE-LTP encoding and decoding process respectively. In section 1 .7, the sequence 
and subjective importance of encoded parameters are given. 

Section 2 deals with the transmission characteristics of the audio parts that are relevant for the performance of the RPE- 
LTP codec. 

Some transmission characteristics of the RPE-LTP codec are also specified in section 2. Section 3 presents the 
functional description of the RPE-LTP coding and decoding procedures, whereas section 4 describes the computational 
details of the algorithm. Procedures for the verification of the correct functioning of the RPE-LTP are described in 
section 5. 

Performance and network aspects of the RPE-LTP codec are contained in annex 1 . 

1 .3 Functional description of audio parts 

The analogue-to-digital and digital-to-analogue conversion will in principle comprise the following elements: 
1) Analogue to uniform digital 
microphone, 

input level adjustment device, 
input anti-aliasing filter, 
sample-hold device sampling at 8 kHz, 
analogue-to-uniform digital conversion to 13 bits representation. 

The uniform format shall be represented in two's complement. 
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2) Uniform digital to analogue 

conversion from 13 bit /8kHz uniform PCM to analogue, 

a hold device, 

reconstruction filter including x/sin x correction, 

output level adjustment device, 

earphone or loudspeaker. 

In the terminal equipment, the A/D function may be achieved either 



by direct conversion to 1 3 bit uniform PCM format. 

or by conversion to 8 bit/A-law companded format, based on a standard A-law codec/filter according to 
CCITT rec. G. 71 1/714, followed by the 8-bit to 13-bit conversion according to the procedure specified in 
section 1.4. 



For the D/A operation, the inverse operations take place. 

In the latter case it should be noted that the specifications in CCITT recommendation G.714 (superseded by G.712) are 
concerned with PCM equipment located in the central parts of the network. When used in the terminal equipment, the 
present document does not on its own ensure sufficient out-of-band attenuation. 

The specification of out-of-band signals is defined in section 2 between the acoustic signal and the digital interface to 
take into account that the filtering in the terminal can be achieved both by electronic and acoustical design. 

1 .4 PCM format conversion 

The conversion between 8 bit A-law companded format and the 13-bituniform format shall be as defined in CCITT 
Recommendation G.721 (superseded by G.726), section 4.2.1, sub-block EXPAND and section 4.2.7, sub-block 
COMPRESS. The parameter LAW = 1 should be used. 

1 .5 Principles of the RPE-LTP Encoder 

A simplified block diagram of the RPE-LTP encoder is shown in Fig 1.1. In this diagram the coding and quantization 
functions are not shown explicitly. 

The input speech frame, consisting of 160 signal samples (uniform 13 bit PCM samples), is first pre-processed to 
produce an offset-free signal, which is then subjected to a first order pre-emphasis filter. The 160 samples obtained are 
then analyzed to determine the coefficients for the short term analysis filter (LPC analysis). These parameters are then 
used for the filtering of the same 160 samples. The result is 160 samples of the short term residual signal. The filter 
parameters, termed reflection coefficients, are transformed to log. area ratios, LARs, before transmission. 

For the following operations, the speech frame is divided into 4 sub-frames with 40 samples of the short term residual 
signal in each. Each sub-frame is processed blockwise by the subsequent functional elements. 

Before the processing of each sub-block of 40 short term residual samples, the parameters of the long term analysis 
filter, the LTP lag and the LTP gain, are estimated and updated in the LTP analysis block, on the basis of the current 
sub-block of the present and a stored sequence of the 120 previous reconstructed short term residual samples. 

A block of 40 long term residual signal samples is obtained by subtracting 40 estimates of the short term residual signal 
from the short term residual signal itself. The resulting block of 40 long term residual samples is fed to the Regular 
Pulse Excitation analysis which performs the basic compression function of the algorithm. 
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As a result of the RPE-analysis, the block of 40 input long term residual samples are represented by one of 4 candidate 
sub-sequences of 13 pulses each. The subsequence selected is identified by the RPE grid position (M). The 13 RPE 
pulses are encoded using Adaptive Pulse Code Modulation (APCM) with estimation of the sub-block amplitude which 
is transmitted to the decoder as side information. 

The RPE parameters are also fed to a local RPE decoding and reconstruction module which produces a block of 40 
samples of the quantized version of the long term residual signal. 

By adding these 40 quantized samples of the long term residual to the previous block of short term residual signal 
estimates, a reconstructed version of the current short term residual signal is obtained. 

The block of reconstructed short term residual signal samples is then fed to the long term analysis filter which produces 
the new block of 40 short term residual signal estimates to be used for the next sub-block thereby completing the 
feedback loop. 

1 .6 Principles of the RPE-LTP Decoder 

The simplified block diagram of the RPE-LTP decoder is shown in fig 1.2. The decoder includes the same structure as 
the feed-back loop of the encoder. In error-free transmission, the output of this stage will be the reconstructed short term 
residual samples. These samples are then applied to the short term synthesis filter followed by the de-emphasis filter 
resulting in the reconstructed speech signal samples. 

1 .7 Sequence and subjective importance of encoded 
parameters 

As indicated in fig 1.1 the three different groups of data are produced by the encoder are: 
the short term filter parameters, 
the Long Term Prediction (LTP) parameters 
- the RPE parameters. 



The encoder will produce this information in a unique sequence and format, and the decoder must receive the same 
information in the same way. In table 1.1, the sequence of output bits bl to b260 and the bit allocation for each 
parameter is shown. 

The different parameters of the encoded speech and their individual bits have unequal importance with respect to 
subjective quality. Before being submitted to the channel encoding function the bits have to be rearranged in the 
sequence of importance as given in GSM 05.03. The ranking has been determined by subjective testing and the 
procedure used is described in annex 2. 
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Table 1.1a: Encoder output parameters in order of occurrence and bit allocation within the speech 

frame of 260 bits/20 ms 
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Table: 1.1b: Encoder output parameters in order of occurrence and bit allocation within the speech 

frame of 260 bits/20 ms 
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Figure 1 .1 : Simplified block diagram of the RPE - LTP encoder 
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Figure 1.2: Simplified block diagram of the RPE - LTP decoder 
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Transmission characteristics 



This section specifies the necessary performance characteristics of the audio parts for proper functioning of the speech 
trancoder. Some transmission performance characteristics of the RPE-LTP transcoder are also given to assist the 
designer of the speech transcoder function. The information given here is redundant and the detailed specifications are 
contained in recommendation GSM 11.10. 

The performance characteristics are referred to the 13 bit uniform PCM interface. 

NOTE: To simplify the verification of the specifications, the performance limits may be referred to an A-law 
measurement interface according to CCITT Recommendation G.71 1. In this way, standard measuring 
equipment for PCM systems can be utilized for measurements. The relationship between the 13 bit format 
and the A-law companded shall follow the procedures defined in section 1 .4. 

2.1 Performance characteristics of the analogue/digital 
interfaces 

Concerning 1) discrimination against out-of-band signals (sending) and 2) spurious out-of-band signals (receiving), the 
same requirements as defined in ETSI standard TE 04-15 (digital telephone, candidate NET33) apply. 



2.2 Transcoder delay 



Consider a back to back configuration where the parameters generated by the encoder are delivered to the speech 
decoder as soon as they are available. 

The transcoder delay is defined as the time interval between the instant a speech frame of 160 samples has been 
received at the encoder input and the instant the corresponding 160 reconstructed speech samples have been out-put by 
the speech decoder at an 8 kHz sample rate. 

The theoretical minimum delay which can be achieved is 20 ms. The requirement is that the transcoder delay should be 
less than 30 ms. 



3 Functional description of the RPE-LTP Codec 

The block diagram of the RPE-LTP-coder is shown in fig 3.1. The individual blocks are described in the following 
sections. 

3.1 Functional description of the RPE-LTP Encoder 

The Preprocessing section of the RPE-LTP encoder comprises the following two sub-blocks: 

* Offset compensation (3.1.1) 

* Preemphasis (3.1.2) 

The LPC analysis section of the RPE-LTP encoder comprises the following five sub-blocks: 

* Segmentation (3.1.3) 

* Auto-Correlation (3.1.4) 
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* Schur Recursion (3.1.5) 

* Transformation of reflection coefficients to Log. -Area Ratios (3.1.6) 

* Quantization and coding of Log. -Area Ratios (3.1.7) 

The Short term analysis filtering section of the RPE-LTP comprises the following four sub-blocks: 

* Decoding of the quantized Log. -Area Ratios (LARs) (3.1.8) 

* Interpolation of Log. -Area Ratios (3.1.9) 

* Transformation of Log. -Area Ratios into reflection coefficients (3.1.10) 

* Short term analysis filtering (3.1.11) 



The Long Term Predictor (LTP) section comprises 4 sub-blocks working on subsegments (3.1.12) of the short term 
residual samples. 

* Calculation of LTP parameters (3.1.13) 



* Coding of the LTP lags (3.1.14) and the LTP gains (3.1.15) 

* Decoding of the LTP lags (3.1.14) and the LTP gains (3.1.15) 

* Long term analysis filtering (3.1.16), and Long term synthesis filtering (3.1.17) 

The RPE encoding section comprises five different sub-blocks: 

* Weighting filter (3.1.18) 

* Adaptive sample rate decimation by RPE grid selection (3.1.19) 

* APCM quantization of the selected RPE sequence (3.1.20) 

* APCM inverse quantization (3.1.21) 

* RPE grid positioning (3.1.22) 
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PREPROCESSING SECTION 

3.1.1 Offset compensation 

Prior to the speech encoder an offset compensation,by a notch filter is applied in order to remove the offset of the input 
signal s to produce the offset-free signal s f. 

s o£ (k) = s o (k) - s o (k-l) + alpha*s o£ (k-l) (3.1.1) 

alpha = 32735*2~ 15 

3.1.2 Preemphasis 

The signal s f is applied to a first order FIR preemphasis filter leading to the input signal s of the analysis section. 

s(k) = s of (k) - beta*s of (k-1) (3.1.2) 

beta= 28180*2~ 15 
LPC ANALYSIS SECTION 

3.1.3 Segmentation 

The speech signal s(k) is divided into non-overlapping frames having a length of Tq = 20 ms (160 samples). A new 
LPC-analysis of order p=8 is performed for each frame. 

3.1.4 Autocorrelation 

The first p+1 = 9 values of the Auto-Correlation function are calculated by 

159 

ACF(k)= 2-i s(i)s(i-k) ,k = 0,1. ..,8 (3.2) 

i=k 

3.1.5 Schur Recursion 

The reflection coefficients are calculated as shown in Fig 3.2 using the Schur Recursion algorithm. The term "reflection 
coefficient" comes from the theory of linear prediction of speech (LPC), where a vocal tract representation consisting of 
series of uniform cylindrical sections is assumed. Such a representation can be described by the reflection coefficents or 
the area ratios of connected sections. 

3.1 .6 Transformation of reflection coefficients to Log. -Area Ratios 

The reflection coefficients r(i), (i=l.,8), calculated by the Schur algorithm, are in the range 

-1 <= r(i) <= + 1 

Due to the favourable quantization characteristics, the reflection coefficients are converted into Log. -Area Ratios which 
are strictly defined as follows: 

1 + r (i) 
Logarea(i) = log^o ( ) (3.3) 

1 - r(i) 

Since it is the companding characteristic of this transformation that is of importance, the following segmented 
approximation is used. 

r (i) ; |r (i) | < 0.675 
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LAR(i) = sign[r (i) ] * [2 |r (i) I-0.675] ; 0.675 <= |r(i)| < 0.950 
sign[r (i) ] * [8 |r (i) I-6.375] ; 0.950 <= |r(i)| <= 1.000 

(3.4) 

with the result that instead of having to divide and obtain the logarithm of particular values, it is merely necessary to 
multiply, add and compare these values. 

The following equation (3.5) gives the inverse transformation. 

LAR' (i) ; |LAR' (i) | <0 . 675 

r' (i)=sign[LAR' (i)]*[0.500* |LAR' (i) | 

+ 0.337500] ; . 675<= | LAR ' (i) | <1 . 225 

sign [LAR' (i)]*[0.125* | LAR ' (i) | 

+0.796875] ; 1 . 225<= | LAR ' (i) | <=1 . 625 

(3.5) 

3.1 .7 Quantization and coding of Log. -Area Ratios 

The Log.-Area Ratios LAR(i) have different dynamic ranges and different asymmetric distribution densities. For this 
reason, the transformed coefficients LAR(i) are limited and quantized differently according to the following equation 
(3.6), with LAR c (i) denoting the quantized and integer coded version of LAR(i). 



LAR C ( i ) 

with 
Nint {z} 



Nint {A(i) *LAR(i) + B(i)} 



int{z+sign{z}*0.5} 



(3.6) 



(3.6a) 



Function Nint defines the rounding to the nearest integer value, with the coefficients A(i), B(i), and different extreme 
values of LAR c (i) for each coefficient LAR(i) given in table 3.1. 

Table 3.1 : Quantization of the Log.-Area Ratios LAR(i) 



LAR No i 


A(i) 


B(i) 


Minimum 
LAR c (i) 


Maximum 
LAR c (i) 


1 


20.000 


0.000 


-32 


+31 


2 


20.000 


0.000 


-32 


+31 


3 


20.000 


4.000 


-16 


+15 


4 


20.000 


-5.000 


-16 


+15 


5 


13.637 


0.184 


-8 


+ 7 


6 


15.000 


-3.500 


-8 


+ 7 


7 


8.334 


-0.666 


-4 


+ 3 


8 


8.824 


-2.235 


-4 


+ 3 



SHORT-TERM ANALYSIS FILTERING SECTION 

The current frame of the speech signal s is retained in memory until calculation of the LPC parameters LAR(i) is 
completed. The frame is then read out and fed to the short term analysis filter of order p=8. However, prior to the 
analysis filtering operation, the filter coefficients are decoded and preprocessed by interpolation. 

3.1 .8 Decoding of the quantized Log.-Area Ratios 

In this block the quantized and coded Log.-Area Ratios (LAR c (i)) are decoded according to equation (3.7). 
LAR ' ' ( i ) = ( LAR C (i)-B(i))/A(i) (3.7) 
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3.1 .9 Interpolation of Log. -Area Ratios 

To avoid spurious transients which may occur if the filter coefficients are changed abruptly, two subsequent sets of 
Log. -Area Ratios are interpolated linearly. Within each frame of 160 analysed speech samples the short term analysis 
filter and the short term synthesis filter operate with four different sets of coefficients derived according to table 3.2. 

Table 3.2: Interpolation of LAR parameters (J=actual segment) 



k 


LAR'j(i) = 


0... 


12 


0.75*LAR"J-1(i) + 0.25*LAR"J(i) 


13.. 


26 


0.50*LAR"J-1(i) + 0.50*LAR"J(i) 


27.. 


39 


0.25*LAR"J-1(i) + 0.75*LAR"J(i) 


40.. 


....159 


LAR"J(i) 



3.1 .1 Transformation of Log. -Area Ratios into reflection coefficients 

The reflection coefficients are finally determined using the inverse transformation according to equation (3.5). 



3.1 .1 1 Short Term Analysis Filtering 

The Short term analysis filter is implemented according to the lattice structure depicted in fig 3.3. 

d (k) = s (k) (3.8a) 

u (k) = s (k) (3.8b) 

d-i_(k) = d i _ 1 (k) + r' i *u i _ 1 (k-1) with i=l,...8 (3.8c) 

u ± (k) = u i _ 1 (k-l) + r' i *d i _ 1 (k) with i=l,...8 (3.8d) 

d(k ) = d 8 (k) (3.8e) 
LONG-TERM PREDICTOR (LTP) SECTION 



3.1.12 Sub-segmentation 

Each input frame of the short term residual signal contains 160 samples, corresponding to 20 ms. The long term 
correlation is evaluated four times per frame, for each 5 ms subsegment. For convenience in the following, we note 
j=0,...,3 the sub-segment number, so that the samples pertaining to the j-th sub-segment of the residual signal are now 
denoted by d(k;+k) with j = 0,...,3; k; = kg + j*40 and k = 0,...,39 where kg corresponds to the first value of the current 

frame. 

3.1 .13 Calculation of the LTP parameters 

For each of the four sub-segments a long term correlation lag N;, (j=0,...,3), and an associated gain factor b;, (j=0,...,3) 
are determined. For each sub-segment, the determination of these parameters is implemented in three steps. 

1) The first step is the evaluation of the cross-correlation R;(lambda) of the current sub-segment of short term 
residual signal d(k;+i),(i=0,...,39) and the previous samples of the reconstructed short term residual signal 
d'(kj+i), (i=-120,...,-l): 



R-; (lambda) 



39 



i=0 



J 



0, . . .3 



7 i d(kj+i) *d' (kj+i-lambda) ; kj 



k 
lambda = 



+■ j*40 
40, . . ., 120 

(3.9) 
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The cross-correlation is evaluated for lags lambda greater than or equal to 40 and less than or equal to 120, ie 
corresponding to samples outside the current sub-segment and not delayed by more than two sub-segments. 

2) The second step is to find the position N; of the peak of the cross-correlation function within this interval: 



Rj(Nj) = max { Rj (lambda) ; lambda = 40.. 120 }; 
j = 0, . . .,3 



3) The third step is the evaluation of the gain factor b; according to: 



(3.10) 



Rj (Nj) / Sj (Nj) ; 



j = 0, . . .,3 



(3.11) 



with 



39 



Sj (Nj) 



Z_, d' : 



i=0 



(kj+i-Nj) ; j = 0, . . .,3 



(3.12) 



It is clear that the last 120 samples of the reconstructed short term residual signal d'(k;+i),(i=-120,...,-l) must be retained 
until the next sub-segment so as to allow the evaluation of the relations (3.9), ...,(3. 12). 

3.1 .14 Coding/Decoding of the LTP lags 

The long term correlation lags N;,(j=0,...,3) can have values in the range (40,. ..,120), and so must be coded using 7 bits 
with: 



en 



Nj; 



0, ...,3 



(3.13) 



At the receiving end, assuming an error free transmission, the decoding of these values will restore the actual lags: 



"C]' 



0, ...,3 



(3.14) 



3.1 .15 Coding/Decoding of the LTP gains 

The long term prediction gains b;,(j=0,...,3) are encoded with 2 bits each, according to the following algorithm: 



if 



bj <= DLB(i) then b c j 



if DLB(i-l) < bj <= DLB(i) then b c j 



if DLB(i-l) < b 



J 



then b 



cj 



= 0; i=0 
i; i=l,2 

3; i=3 



(3.15) 



where DLB(i),(i=0,...,2) denotes the decision levels of the quantizer, and b c ; represents the coded gain value. Decision 
levels and quantizing levels are given in table 3.3. 

Table 3.3: Quantization table for the LTP gain 



i 


Decision level 
DLB(i) 


Quantizing level 
QLB(i) 





0.2 


0.10 


1 


0.5 


0.35 


2 


0.8 


0.65 


3 




1.00 
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The decoding rule is implemented according to: 



b- 



QLB (b c . 



0, 



,3 



(3.16) 



where QLB(i),(i=0,...,3) denotes the quantizing levels, and b;' represents the decoded gain value (see table 3.3). 



3.1.16 Long term analysis filtering 

The short term residual signal d(kQ+k),(k=0,..., 159) is processed by sub-segments of 40 samples. From each of the four 
sub-segments (j=0,...,3) of short term residual samples, denoted here d(k;+k), (k=0,...,39), an estimate d"(k;+k), 
(k=0,...,39) of the signal is subtracted to give the long term residual signal e(k;+k), (k=0,...,39) (see fig 3.1): 



e(kj+k) = d(kj+k) - d" (kj+k) 



j = 0, . . .,3 
k = 0, . . .,39 

kj = k + j*40 



(3.17) 



Prior to this subtraction, the estimated samples d"(k;+k) are computed from the previously reconstructed short term 
residual samples d', adjusted to the current sub-segment LTP lag N;' and weighted with the sub-segment LTP gain b;' 



d" (kj+k) = bj'*d' (kj+k-Nj') 



J 

j = 0, . . .,3 
k = 0, . . .,39 
kj = k + j*40 



J 



(3.18) 



3.1 .1 7 Long term synthesis filtering 

The reconstructed long term residual signal e'(kQ+k),(k=0,...,159) is processed by sub-segments of 40 samples. To each 
sub-segment, denoted here e'(k;+k), (k=0,...,39), the estimate d"(k;+k), (k=0,...,39) of the signal is added to give the 
reconstructed short term residual signal d'(k;+k),(k=0,...,39): 



d' (kj+k) = e' (kj+k) + d" (kj+k) ; k = 0,...,39 (3.19) 



j = 0, . . .,3 
; k = 0, . . .,39 



k 



j = k + j*40 



RPE ENCODING SECTION 

3.1.18 Weighting Filter 

A FIR 'block filter' algorithm is applied to each sub-segment by convolving 40 samples e(k) with the impulse response 
H(i) ; i=0,...,10 (see table 3.4). 

Table 3.4: Impulse response of block filter (weighting filter) 



i 


5 


4(6) 


3(7) 


2(8) 


1(9) 


0(10) 


H(\)*2 V:> 


8192 


5741 


2054 





-374 


-134 



|H(Omega=0)| = 2.779; 

The conventional convolution of a sequence having 40 samples with an 1 1-tap impulse response would produce 
40+1 1-1=50 samples. In contrast to this, the 'block filter' algorithm produces the 40 central samples of the conventional 
convolution operation. For notational convenience the block filtered version of each sub-segment is denoted by x(k), 
k=0,...,39. 
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10 

x(k) = 2-i H(i) * e(k+5-i) with k = 0,...,39 (3.20) 

i=0 

NOTE : e(k+5-i) = for k+5-i<0 and k+5-i>39. 

3.1 .19 Adaptive sample rate decimation by RPE grid selection 

For the next step, the filtered signal x is down-sampled by a ratio of 3 resulting in 3 interleaved sequences of lengths 14, 
13 and 13, which are split up again into 4 sub-sequences x m of length 13: 

x m (i) = x(kj+m+3*i) ; i = 0,...,12 (3.21) 

m = 0, . . . , 3 

with m denoting the position of the decimation grid. According to the explicit solution of the RPE mean squared error 
criterion, the optimum candidate sub-sequence xjy[ is selected which is the one with the maximum energy 

12 



2 x m 2 d) 



E M = max ^j x m z (i) ; m = , . . . , 3 (3.22) 

m i=0 

The optimum grid position M is coded as M c with 2 bits. 

3.1 .20 APCM quantization of the selected RPE sequence 

The selected sub-sequence Xty[(i) (RPE sequence) is quantized, applying APCM (Adaptive Pulse Code Modulation). 
For each RPE sequence consisting of a set of 13 samples xjy[(i) ,the maximum x max of the absolute values lxjy[(i)l is 
selected and quantized logarithmically with 6 bits as x maxc as given in table 3.5. 
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Table 3.5: Quantization of the block maximum x max 



Xmax 


X max 


Xmaxc 




















x 


nax 


X max 


Xmaxc 


0.. 31 


31 







2048. 


2303 


2303 


32 


32 .. 63 


63 


1 




2304. 


2559 


2559 


33 


64.. 95 


95 


2 




2560. 


2815 


2815 


34 


96.. 127 


127 


3 




2816. 


3071 


3071 


35 


128.. 159 


159 


4 




3072. 


3327 


3327 


36 


160.. 191 


191 


5 




3328. 


3583 


3583 


37 


192 ..223 


223 


6 




3584. 


3839 


3839 


38 


224 .. 255 


255 


7 




3840. 


4095 


4095 


39 


256 .. 287 


287 


8 




4096. 


4607 


4607 


40 


288 ..319 


319 


9 




4608. 


5119 


5119 


41 


320 .. 351 


351 


10 




5120. 


5631 


5631 


42 


352 .. 383 


383 


11 




5632. 


6143 


6143 


43 


384 ..415 


415 


12 




6144. 


6655 


6655 


44 


41 6. .447 


447 


13 




6656. 


7167 


7167 


45 


448 .. 479 


479 


14 




7168. 


7679 


7679 


46 


480 ..511 


511 


15 




7680. 


8191 


8191 


47 


51 2. .575 


575 


16 




8192. 


9215 


9215 


48 


576 .. 639 


639 


17 




9216.. 


10239 


10239 


49 


640 .. 703 


703 


18 




10240. 


11263 


11263 


50 


704 .. 767 


767 


19 




11264. 


12287 


12287 


51 


768 .. 831 


831 


20 




12288. 


13311 


13311 


52 


832 .. 895 


895 


21 




13312. 


14335 


14335 


53 


896 .. 959 


959 


22 




14336. 


15359 


15359 


54 


960.. 1023 


1023 


23 




15360. 


16383 


16383 


55 


1024.. 1151 


1151 


24 




16384. 


18431 


18431 


56 


1152.. 1279 


1279 


25 




18432. 


20479 


20479 


57 


1280.. 1407 


1407 


26 




20480 . 


22527 


22527 


58 


1408.. 1535 


1535 


27 




22528 . 


24575 


24575 


59 


1536.. 1663 


1663 


28 




24576 . 


26623 


26623 


60 


1664.. 1791 


1791 


29 




26624 . 


28671 


28671 


61 


1792.. 1919 


1919 


30 




28672 . 


30719 


30719 


62 


1920.. 2047 


2047 


31 




30720 . 


32767 


32767 


63 



For the normalization, the 13 samples are divided by the decoded version x' max of the block maximum. Finally, the 
normalized samples 

x'(i) = x M<i)/ x 'max > i=0,...,12 (3.23) 

are quantized uniformly with three bits to xjy[ c (i) as given in table 3.6. 

Table 3.6: Quantization of the normalized RPE-samples 



x'*2 15 


Xm'*2 13 


Xmc 


(Interval-limits) 




(Channel) 


-32768 ... -24577 


-28672 


= 000 


-24576... -16385 


-20480 


1 =001 


-16384... -8193 


-12288 


2 = 010 


-8192... -1 


-4096 


3 = 011 


0... 8191 


4096 


4 = 100 


8192... 16383 


12288 


5 = 101 


16384... 24575 


20480 


6 = 110 


24576... 32767 


28672 


7 = 111 



3.1 .21 APCM inverse quantization 

The xjy[ c (i) are decoded to x^j'(i) an d denormalized using the decoded value x' maxc leading to the decoded sub- 
sequence x']y[(i). 
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3.1 .22 RPE grid positioning 



The quantized sub-sequence is upsampled by a ratio of 3 by inserting zero values according to the grid position given 
with Mc. 

3.2 Decoder 

The decoder comprises the following 4 sections. Most of the sub-blocks are also needed in the encoder and have been 
described already. Only the short term tynthesis filter and the deemphasis filter are added in the decoder as new sub- 
blocks. 

* RPE decoding section (3.2.1) 

* Long Term Prediction section (3.2.2) 

* Short term synthesis filtering section (3.2.3) 

* Postprocessing (3.2.4) 



The complete block diagram for the decoder is shown in fig 3.4. The variables and parameters of the decoder are 
marked by the index r to distinguish the received values from the encoder values. 



3.2.1 RPE decoding section 

The input signal of the long term synthesis filter (reconstruction of the long term residual signal) is formed by decoding 
and denormalizing the RPE-samples (APCM inverse quantization - 3.1.21) and by placing them in the correct time 
position (RPE grid positioning - 3.1.22). At this stage, the sampling frequency is increased by a factor of 3 by inserting 
the appropriate number of intermediate zero-valued samples. 

3.2.2 Long Term Prediction section 

The the reconstructed long term residual signal e r ' is applied to the long term synthesis filter (see 3.1.16 and 3.1.17) 
which produces the reconstructed short term residual signal d r ' for the short term synthesiser. 

3.2.3 Short term synthesis filtering section 

The coefficients of the short term synthesis filter (see fig 3.5) are reconstructed applying the identical procedure to that 
in the encoder (3.1.8 - 3.1.10). The short term synthesis filter is implemented according to the lattice structure depicted 
in fig 3.5. 

s r(0) (k) = d r ' (k) (3.24a) 

s r (i)(k) = s r ( i _ 1 )(k) - r r ' (9-i) * v 8 _ i (k-l); i=l,...,8 

(3.24b) 
vg_-i_(k) = V3_-i_(k-l) + r r '(9_ i ) * s r (-j_)(k); i=l,...,8 

(3.24c) 
s r * (k) = s r(8) (k) (3.24d) 

v (k) = s r (g) (k) (3 .24e) 

3.2.4 Postprocessing 

The output of the synthesis filter s r (k) is fed into the IIR- deemphasis filter leading to the output signal s r „. 
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n = 1 



ACF = ? 



yes 



no 



K(9-i) = ACF(i);i=1, 
P(j) = ACF(j); j=0, 



,7 
-8 



P(0)<|P(1)|? 
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Figure 3.2: LPC analysis using Schur recursion 
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4 Computational details of the RPE-LTP Codec 

4.1 Data representation and arithmetic operations 

Only two types of variables are used along the implementation of the RPE-LTP algorithm in fixed point arithmetic. 
These two types are: 

Integer on 16 bits; 

Long integer on 32 bits; 



This assumption simplifies the detailed description and allows the maximum reach of precision. 

In different places of the recommendation, different scaling factors are used according to different operations. To help 
the reader in the comparison of corresponding floating point and fixed point values given in section 3 and 4 comments 
of the format: 

/* var = integer ( real_var * scalefactor ) */ 

are used at several points of section 4. var is the rounded fixed point representation of the floating point representation 
of var (real_var) using the given scaling factor. 

In the description, input signal samples, coded parameters and output signal samples are represented by 16 bit words. At 
the receiving part it must therefore be ensured that only valid bits (13 bits for samples signal and two to seven bits for 
coded parameters) are used. In verification tests, the testing system may introduce random bit at non valid places inside 
these samples (3 LSBs) or parameters (MSBs) to test this function. In the digital test sequences all non valid bits are set 
to 0. 

The following part of this section describes the required set of arithmetic operations to implement the RPE-LTP 
algorithm in fixed point. 

For arithmetics operations or variables with a long integer type (32 bit) a prefix L_ is used in order to distinguish them 
from the 16 bit variables or arithmetic operations. 
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All the names of the variables are identical to those of the functional description of the RPE-LTP Codec (Section 3) but 
variables like x', x" are respectively called: 

x ' > xp 

x ' ' > xpp 



in order to avoid any confusing notation. 

NOTE : The x',x" variables are examples but are not used within the following description. 

The following notations are used in the arithmetic operations: 

Square brackets ( [..] ) are used for arrays and when needed, the starting index and the ending index are put inside the 
bracket. For example x[0..159] means that x is an array of 160 words of 16 bits with beginning index and ending 
index 159 and x[k] is an element of the array x[0..159]. 

All functions' names are underlined. For example add ( x, y) means that we perform the addition of x and y. 

« n : denotes a n-bit arithmetic shift left operation (zero fill) on variables of type short or long; if n is 
less than 0, this operation becomes an arithmetic right shift of -n. 

» n : denotes a n-bit arithmetic right shift operation (sign extension ) on variables of type short or long; if n is 

less than 0, this operation becomes an arithmetic left shift of -n (zero fill). 

a > b : denotes the "greater than" condition; 

a >= b : denotes the "greater than or equal" condition; 

a < b : denotes the "less than" condition; 

a <= b : denotes the "less than or equal" condition; 

a == b : denotes the "equal to" condition. 

The basic structure of the FOR-NEXT loop is used in this description for loop computation; the declaration is: 

I == FOR k= start to end: 

inner computation; 
|== NEXT k: 

Also the IF.. ELSE IF structure is used throughout this detailed description. The basic structure is: 

IF (conditionl) THEN statementl; 

ELSE IF ( condition2) THEN statement2; 

ELSE IF ( condition3) THEN statement3; 

The word EXIT is used to exit immediately from a procedure. 

The following arithmetic operations are defined: 

add ( varl, var2) : performs the addition (varl+var2) with overflow control and saturation; the result is set at 

+32767 when overflow occurs or at -32768 when underflow occurs. 

sub( varl, var2) : performs the subtraction (varl-var2) with overflow control and saturation; the result is set 

at +32767 when overflow occurs or at -32768 when underflow occurs. 

mult ( varl, var2) : performs the multiplication of varl by var2 and gives a 16 bits result which is scaled ie 

mult (varl,var2 ) = (varl times var2) » 15 and mult ( -32768, -32768 ) = 32767 

mult r ( varl, var2) : same as mult but with rounding ie mult r ( varl, var2 ) = ( (varl times var2) + 16384 ) » 

15 and mult r ( -32768, -32768 ) = 32767 

abs ( varl ) : absolute value of varl; abs(-32768) = 32767 
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div( varl, var2) : div produces a result which is the fractional integer division of varl by var2; varl and var2 

must be positive and var2 must be greater or equal to varl; The result is positive (leading 
bit equal to 0) and truncated to 16 bits, if varl == var2 then div ( varl, var2 ) = 32767 

L_mult(varl, var2) : L mult is a 32 bit result for the multiplication of varl times var2 with a one bit shift left. 

L mult ( varl, var2 ) = ( varl times var2 ) « 1, The condition L mult (-32768, -32768 ) 
does not occur in the algorithm. 

L add (L varl, L_var2) : 32 bits addition of two 32 bits variables (L_varl + L_var2) with overflow control and 

saturation; the result is set at 2147483647 when overflow occurs and at -2147483648 when 
underflow occurs. 

L_sub(L_varl,L_var2) : 32 bits subtraction of two 32 bits variables (L_varl - L_var2) with overflow control and 

saturation; the result is set at 2147483647 when overflow occurs and at -2147483648 when 
underflow occurs. 

norm( L_varl ) : norm produces the number of left shifts needed to normalize the 32 bits variable L_varl for 

positive values on the interval with minimum of 1073741824 and maximum of 
2147483647 and for negative values on the interval with minimum of -2147483648 and 
maximum of -1073741824; in order to normalize the result, the following operation must 
be done: L_norm_varl = L_varl « norm (L varl) 

L_var2 = varl; : deposit the 16 bits of varl in the LSB 16 bits of L_var2 with sign extension. 

var2 = L_varl; : extract the 16 LSB bits of L_varl to put in var2. 

When a constant is used in an operation on 32 bits, it must be first sign-extended on 32 bits. 

4.2 Fixed point implementation of the RPE-LTP Coder 

The RPE-LTP coder works on a frame by frame basis. The length of the frame is equal to 160 samples. Some 
computations are done once per frame (analysis) and some others for each of the four sub-segments (40 samples). 

In the following detailed description, procedure 4.2.0 to 4.2.10 are done once per frame to produce at the output of the 
coder the LARc[1..8] parameters which are the coded LAR coefficients and also to realize the inverse filtering 
operation for the entire frame (160 samples of signal d[0..159]). These parts produce at the output of the coder: 

I LARc[1..8] : Coded LAR coefficients 

I — > These parameters are calculated and sent once per frame. 

Procedure 4.2.1 1 to 4.2.18 are to be executed four times per frame. That means once for each sub-segment RPE-LTP 
analysis of 40 samples. These parts produce at the output of the coder: 

Nc : LTP lag; 
be : Coded LTP gain; 
Mc : RPE grid selection; 

xmaxc : Coded maximum amplitude of the RPE sequence; 

xMc[0..12] : Codes of the normalized RPE samples; 
--> These parameters are calculated and sent four times per frame. 

PREPROCESSING SECTION 



4.2.0 Scaling of the input variable 



After A-law to linear conversion (or directly from the A to D converter) the following scaling is assumed for input to 
the RPE-LTP algorithm: 

S . v. v. v. v. v. v. v. v. v. v. v. v. x . x . x ( 2's complement format). 

Where S is the sign bit, v a valid bit, and x a "don't care" bit. 

The original signal is called sop [ . . ] ; 



4.2.1 Downscaling of the input signal 



|== FOR k=0 to 159: 

I so[k] = sop[k] » 3; 
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so[k] = so[k] << 2; 
|== NEXT k: 

4.2.2 Offset compensation 

This part implements a high-pass filter and requires extended arithmetic precision for the recursive part of this filter. 
The input of this procedure is the array so[0..159] and the output the array sof[0..159]. 

== FOR k = to 159: 
Compute the non-recursive part. 

si = sub ( so[k], zl ); 

zl = so [k] ; 
Compute the recursive part . 

L_s 2 = si; 

L_s2 = L_s2 << 15; 

Execution of a 31 by 16 bits multiplication. 

msp = L_z2 >> 15; 

lsp = L_sub ( L_z2, ( msp << 15 ) ) ; 

temp = mult_r ( lsp, 32735 ); 

L_s2 = L_add ( L_s2, temp ); 

L_z2 = L_add ( L_mult ( msp, 32735 ) >> 1, L_s2 ); 
Compute sof[k] with rounding . 

sof[k] = L_add ( L_z2, 16384 ) >> 15; 
== NEXT k: 

Keep zl and L z2 in memory for the next frame . 
Initial value: zl=0; L z2=0; 



4.2.3 Preemphasis 



|== FOR k=0 to 159: 

I s[k] = add( sof[k], mult_r ( mp, -28180 ) ); 

I mp = s o f [ k ] ; 

|== NEXT k: 

Keep mp in memory for the next frame . 

Initial value : mp=0; 

LPC ANALYSIS SECTION 

4.2.4 Autocorrelation 

The goal is to compute the array L_ACF[k]. The signal s[i] must be scaled in order to avoid an overflow situation. 
Dynamic scaling of the array s[0..1591 . 
Search for the maximum . 

smax = ; 

|== FOR k = to 159: 

temp = abs ( s [k] ) ; 
I IF ( temp > smax ) THEN smax = temp; 
|== NEXT k; 

Computation of the scaling factor . 

IF ( smax == ) THEN scalauto = 0; 

ELSE scalauto = sub ( 4, norm( smax << 16 ) ); 
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Scaling of the array s[0..159l . 

IF ( scalauto > ) THEN 

I temp = 16384 >> sub ( scalauto, 1). 
|== FOR k = to 159: 

s [ k ] = mult_r ( s [ k ] , t emp ) ; 
|== NEXT k: 

Compute the L ACF[..l . 

= FOR k=0 to 8: 

L_ACF [ k ] = ; 
=== FOR i=k to 159: 

L_temp = L_mult ( s[i], s[i-k] ); 

L_ACF[k] = L_add ( L_ACF [k] , L_temp ); 
=== NEXT i: 
= NEXT k: 

Rescaling of the array s[0..1591 . 

IF ( scalauto > ) THEN 

|== FOR k = to 159: 

I s [k] = s [k] << scalauto; 

|== NEXT k: 



4.2.5 Computation of the reflection coefficients 

Schur recursion with 16 bits arithmetic . 

IF ( L_ACF[0] == ) THEN 

|== FOR i = 1 to 8: 

I r[i] = 0; 

|== NEXT i: 

I EXIT; /continue with section 4.2.6/ 
temp = norm( L_ACF[0] ); 
|== FOR k=0 to 8: 

I ACF[k] = ( L_ACF[k] << temp ) >> 16; 

|== NEXT k: 

Initialize array P[..l and K[..l for the recursion . 

= FOR i=l to 7: 

K[9-i] = ACF[i] ; 
= NEXT i: 
= FOR i=0 to 8: 

P[i] = ACF[i] ; 
= NEXT i: 

Compute reflection coefficients . 



FOR n=l to 8: 
IF ( P[0] < abs ( P [1] ) ) THEN 



== FOR i = n to 8: 

r[i] = 0; 
== NEXT i: 

EXIT; /continue with 
section 4.2.6/ 



r[n] = div( abs ( P[l] ), P[0] ); 

IF ( P[l] > ) THEN r[n] = sub ( 0, r[n] ); 



IF ( n 



) THEN EXIT; /continue with 
section 4.2-6/ 



Schur recursion. 



P[0] = add( P[0], mult_r ( P[l], r[n] ) ); 
== FOR m=l to 8-n: 

P [m] = add ( P[m+1], mult_r ( K[9-m], r[n] ) ); 

K[9-m] = add ( K[9-m], mult_r ( P[m+1], r[n] ) ) 
== NEXT m: 

NEXT n: 
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NOTE : The following lines gives one correct implementation of the div(num, denum) arithmetic operation. 
Compute div which is the integer division of num by denum: with denum >= num > 0. 



L_num = num; 
L_denum = denum; 
div =0; 

= FOR k = to 14: 
div= div << 1; 
L_num = L_num < < 1 ; 
IF ( L_num >= L_denum) THEN 

I L_num=L_sub (L_num, L_denum) ; 
div = add ( div ,1 ) ; 
= NEXT k: 



4.2.6 Transformation of reflection coefficients to Log. -Area Ratios 



The following scaling for r[..] and LAR[..] has been used: 



integer ( real_r [ . . ] *32768 . ); -1. <= real_r <1 . */ 



/* r[. .] = 

/* 

/ * LAR [ . . ] 

/* 

/* with -1.625 <= real_LAR <= 1.625 



integer ( real_LAR[ . . ] *16384 . ) 



Computation of the LART1..81 from the r[l.. 81 . 



FOR i = 1 to 8: 
temp = abs ( r[i] ); 

IF ( temp < 22118 ) THEN temp = temp >> 1; 
ELSE IF ( temp < 31130 ) THEN 
temp= sub (temp, 11059); 
ELSE temp = sub ( temp, 26112 ) << 2; 
LAR[i] = temp; 

IF ( r[i] < ) THEN LAR[i] = sub ( 0, LAR[i] ); 
NEXT i: 



4.2.7 Quantization and coding of the Log. -Area Ratios 

This procedure needs four tables; the following equations give the optimum scaling for the constants: 



/* A[l. .8] 

/* 

/* B[l. .8] 

/* 

/ * MAC [ 1 . . 

/* 

/* MIC[1. . 



= integer( real_A [ 1 . . 8 ] *1024 ) 
= integer( realj [ 1 . . 8 ] *512 ) ; 
8]= maximum of the LARc [ 1 . . 8 ] 
8]= minimum of the LARc [ 1 . . 8 ] 



values (see table4.1)*/ 
*/ 

values (see table4.1)*/ 
*/ 

values (see table4.1)*/ 
*/ 

values (see table4.1)*/ 



Computation for quantizing and coding the LAR[1..81 . 



FOR i =1 to 8: 
temp= mult ( A[i], LAR[i] ); 
temp= add ( temp, B[i] ); 

temp= add ( temp, 256); for rounding 
LARc[i]= temp >> 9; 

Check IF LARc[i] lies between MIN and MAX 

IF ( LARc[i] > MAC[i] ) THEN LARc[i] = MAC[i]; 
IF ( LARc[i] < MIC[i] ) THEN LARc[i] = MIC[i]; 
LARc[i] = sub( LARc[i], MIC[i] ); /See note below/ 
NEXT i: 



NOTE : The equation is used to make all the LARc[i] positive. 
SHORT TERM ANALYSIS FILTERING SECTION 

4.2.8 Decoding of the coded Log. -Area Ratios 

This procedure requires for efficient implementation two tables. 

/* INVA[1. . 8]=integer ( (32768*8) / (real_A[l. . 8] ) ; *, 
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/* 8 values (table 4.2 ) */ 

/* MIC[1. . 8]=minimum value of the LARc [ 1 . . 8 ] ; */ 

/* 8 values (table 4.1) */ 

Compute the LARpp [1.. 81 . 

FOR i=l to 8: 

tempi = add( LARc[i], MIC[i] ) << 10; /See note below/ 

temp2 = B[i] << 1; 

tempi = sub ( tempi, temp2); 

tempi = mult_r ( INVA[i], tempi); 

LARpp[i] = add ( tempi, tempi); 
NEXT i: 

NOTE : The addition of MIC[i] is used to restore the sign of LARc[i] . 

4.2.9 Computation of the quantized reflection coefficients 

Within each frame of 160 analysed speech samples the short term analysis and synthesis filters operate with four 
different sets of coefficients, derived from the previous set of decoded LARs(LARpp(j-l)) and the actual set of decoded 
LARs (LARppCJ)). 

4.2.9.1 Interpolation of the LARpp[1 ..8] to get the LARp[1 ..8] 

For k_start = to k„end = 12. 
|==== FOR i= 1 to 8: 
I LARp[i] = add ( (LARpp ( j-1) [i] >> 2 ) , (LARpp ( j ) [i] » 2)); 

LARp[i] = add( LARp[i] , ( LARpp ( j-1 ) [i] » 1 ) ); 
|==== NEXT i: 

For k_start = 13 to k_end = 26. 
|==== FOR i= 1 to 8: 

I LARp[i] = add ( (LARpp (j-1) [1] » 1 ), (LARpp ( j ) [i] >> 1 )); 
|==== NEXT i: 

For k_start = 27 to k_end = 39. 
==== FOR i= 1 to 8: 

LARp[i] = add ( (LARpp (j-1) [1] » 2 ) , (LARpp ( j ) [i] » 2 )); 
I LARp[i] = add( LARp[i] , ( LARpp (j)[i] » 1 ) ); 
|==== NEXT i: 

For k_start = 40 to k_end = 159. 
|==== FOR i= 1 to 8: 
I LARp[i] = LARpp (j) [1]; 
==== NEXT i: 

Initial value : LARpp(j-l)[1..8]=0; 

4.2.9.2 Computation of the rp[1 ..8] from the interpolated LARp[1 ..8] 

The input of this procedure is the interpolated LARp[1..8] array. The reflection coefficients, rp[i], are used in the 
analysis filter and in the synthesis filter. 

= FOR i=l to 8: 

temp = abs ( LARp[i] ); 

IF ( temp < 11059 ) THEN temp = temp << 1; 
ELSE IF (temp < 20070) THEN 

temp = add(temp, 11059); 
ELSE temp = add ( (temp >> 2), 26112 ); 
r p [ i ] = t emp ; 

IF ( LARp[i] < ) THEN rp [ i ] = sub ( 0, rp[i] ); 
= NEXT i: 

4.2.1 Short term analysis filtering 

This procedure computes the short term residual signal d[..] to be fed to the RPE-LTP loop from the s[..] signal and 
from the local rp[..] array (quantized reflection coefficients). As the call of this procedure can be done in many ways 
(see the interpolation of the LAR coefficient), it is assumed that the computation begins with index k_start (for arrays 
d[..] and s[..]) and stops with index k_end (k_start and k_end are defined in 4.2.9.1). This procedure also needs to keep 
the array u[0..7] in memory for each call. 
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= FOR k = k_start to k_end: 
di = s [k] 
sav = di; 
=== FOR i = 1 to 8: 

temp = add ( u[i-l], mult_r ( rp[i], di ) ); 
di = add( di, mult_r ( rp[i], u[i-l] ) ); 
u[i-l] = sav; 
sav = temp; 
=== NEXT i: 

d[k] = di; 
= NEXT k: 

Keep the array u[0..71 in memory . 

Initial value : u[0..7]=0; 

LONG TERM PREDICTOR (LTP) SECTION 

4.2.1 1 Calculation of the LTP parameters 

This procedure computes the LTP gain (be) and the LTP lag (Nc) for the long term analysis filter. This is done by 
calculating a maximum of the cross-correlation function between the current sub-segment short term residual signal 
d[0..39] (output of the short term analysis filter; for simplification the index of this array begins at and ends at 39 for 
each sub-segment of the RPE-LTP analysis) and the previous reconstructed short term residual signal dp[-120..-l]. A 
dynamic scaling must be performed to avoid overflow. 

Search of the optimum scaling of d[0..391 . 

dmax = ; 

|== FOR k = to 39: 

I temp = abs ( d[k] ); 

I IF ( temp > dmax ) THEN dmax = temp; 

|== NEXT k: 

temp = 0; 

IF ( dmax == ) THEN seal = 0; 

ELSE temp = norm{ dmax << 16 ) ; 
IF ( temp > 6 ) THEN seal = 0; 

ELSE seal = sub ( 6, temp ); 

Initialisation of a working array wt[0..391 . 

== FOR k = to 39: 
I wt [k] = d[k] » seal; 
|== NEXT k: 

Search for the maximum cross-correlation and coding of the LTP lag . 

L_max = ; 

Nc = 40; (index for the maximum cross-correlation) 
= FOR lambda = 40 to 120: 

L_result = 0; 
=== FOR k = to 39: 

L_temp = L_mult ( wt [k] , dp[k-lambda] ); 
L_result = L_add ( L_temp, L_result ); 
=== NEXT k: 

IF ( L_result > L„max) THEN 

I Nc = lambda; 
I L_max = L_result ; 
= NEXT lambda: 

Rescaling of L max . 

L_max = L_max >> ( sub ( 6, seal ) ) ; 

Initialisation of a working array wt[0..391 . 

== FOR k = to 39: 
I wt [k] = dp[k-Nc] » 3; 

|== NEXT k: 

Compute the power of the reconstructed short term residual signal dpf.-l . 
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Ljower = 0; 

|== FOR k =0 to 39: 

I L_temp = L mult ( wt [k] , wt [k] ); 

I L_power = L add ( L_temp, L_power ); 

|== NEXT k: 

Normalization of L max and L power . 

IF ( L_max <= ) THEN 

I be = 0; 

I EXIT; /cont. with 4.2.12/ 
IF ( L_max >= L_power ) THEN 

I be = 3; 

I EXIT; /cont. with 4.2.12/ 
temp = norm ( L_power ) ; 
R = ( L_max << temp ) >> 16; 
S = ( L_power << temp ) >> 16; 

Coding of the LTP gain . 

Table 4.3a must be used to obtain the level DLB[i] for the quantization of the LTP gain b to get the coded version be. 

|== FOR be = to 2: 

I IF (R <= mult (S, DLB[bc])) THEN EXIT; /cont. with 

4.2.12/ 
|== NEXT be; 
be = 3; 

Initial value : dp[-120..-l]=0; 



4.2.1 2 Long term analysis filtering 



In this part, we have to decode the be parameter to compute the samples of the estimate dpp[0..39]. The decoding of be 
needs the use of table 4.3b. The long term residual signal e[0..39] is then calculated to be fed to the RPE encoding 
section. 

Decoding of the coded LTP gain . 

bp = QLB[bc] ; 

Calculating the array e[0..391 and the array dpp[0..391 . 

== FOR k = to 39: 

I dpp[k] = mult r ( bp, dp[k-Nc] ); 

I e[k] = sub( d[k] , dpp[k] ) ; 

|== NEXT k: 

RPE ENCODING SECTION 

4.2.13 Weighting filter 

The coefficients of the weighting filter are stored in a table (see table 4.4). The following scaling is used: 

/* H[0..10] = integer( real_H [0 . . 10 ] *8192 ); */ 

Initialisation of a temporary working array wt[0..491 . 

= FOR k= to 4: 

wt [k] = 0; 
= NEXT k: 

= FOR k = 5 to 44: 
wt [k] = e[k-5] ; 
= NEXT k: 

= FOR k= 45 to 49: 

wt [ k ] = ; 
= NEXT k: 

Compute the signal x[0..391 . 

== FOR k= to 39: 
I L_result = 8192; /rounding of the output 

of the filter/ 
|==== FOR i = to 10: 
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L_temp = L mult ( wt[k+i], H[i] ); 
L_result = L add ( L_result, L_temp ); 
== NEXT i: 

L_result = L add (L result, L result); /scaling (x2)/ 
L_result = L add (L result, L result); /scaling (x4)/ 
x[k] = L_result >> 16; 
NEXT k: 



4.2.14 RPE grid selection 



The signal x[0..39] is used to select the RPE grid which is represented by Mc. 



EM =0; 
Mc = 0; 



FOR m = to 3: 

L_result = 0; 
== FOR i = to 12: 

tempi = x[m+(3*i)] >> 2; 
L_t emp = L mult ( t emp 1 , t emp 1 ) ; 
L_result = L add ( L_temp, L_result ); 
== NEXT i: 

IF ( L_result > EM) THEN 

I Mc = m; 
I EM = L_result; 
NEXT m: 



Down-sampling by a factor 3 to get the selected xM[0..121 RPE sequence . 



|== FOR i = to 12: 

xM[i] = x[Mc + (3*i) ] ; 
|== NEXT i: 



4.2.15 APCM quantization of the selected RPE sequence 



Find the maximum absolute value xmax of xM[0..121 . 



xmax = 0; 

|== FOR i = to 12: 

I temp = abs ( xM[i] ) ; 

I IF ( temp > xmax ) THEN xmax 

|== NEXT i: 



temp; 



Quantizing and coding of xmax to get xmaxc. 



exp = 0; 

temp = xmax >> 9; 

itest = 0; 

= FOR i = to 5: 

IF ( temp <= ) THEN itest 

t emp = t emp > > 1 ; 

IF ( itest == ) THEN exp = 
= NEXT i: 



= i; 
add ( exp, 1 ) 



temp = add ( exp, 5 ) ; 

xmaxc = add ( ( xmax >> temp ) , ( exp << 3 ) ) ; 

Quantizing and coding of the xM[0..121 RPE sequence to get the xMc[0..121 . 

This computation uses the fact that the decoded version of xmaxc can be calculated by using the exponent and the 
mantissa part of xmaxc (logarithmic table). 

So, this method avoids any division and uses only a scaling of the RPE samples by a function of the exponent. A direct 
multiplication by the inverse of the mantissa (NRFAC[0..7] found in table 4.5) gives the 3 bit coded version xMc[0..12] 
of the RPE samples. 
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Compute exponent and mantissa of the decoded version of xmaxc . 

exp = ; 

IF ( xmaxc > 15 ) THEN exp = sub ( ( xmaxc >> 3 ) , 1 ) ; 

mant = sub ( xmaxc , ( exp << 3 ) ) ; 

Normalize mantissa <= mant <= 7 . 

IF ( mant == ) THEN | exp = -4; 

I mant = 15; 
ELSE | itest = 0; 

= FOR i = to 2: 

IF ( mant > 7 ) THEN itest = 1; 

IF (itest == 0) THEN mant = add ( (mant << 1),1); 
IF ( itest == ) THEN exp = sub ( exp, 1 ); 
= NEXT i: 
mant = sub ( mant, 8 ) ; 

Direct computation of xMc[0..121 using table 4.5 . 

templ= sub ( 6, exp ); /normalization by the exponent/ 
temp2 = NRFAC[mant]; /see table 4.5 (inverse mantissa)/ 

FOR i = to 12: 

t emp = xM [ i ] < < t emp 1 ; 

temp = mult ( temp , temp2 ) ; 

xMc[i] = add( ( temp » 12 ) , 4 ) ; /See note below/ 
NEXT i: 

NOTE : This equation is used to make all the xMc[i] positive. 
Keep in memory exp and mant for the following inverse APCM quantizer . 

4.2.16 APCM inverse quantization 

This part is for decoding the RPE sequence of coded xMc[0..12] samples to obtain the xMp[0..12] array. Table 4.6 is 
used to get the mantissa of xmaxc (FAC[0..7]). 

tempi = FAC[mant]; see 4.2.15 for mant 
temp2= sub ( 6, exp ); see 4.2.15 for exp 
temp3= 1 << sub ( temp2, 1 ); 

FOR i =0 to 12: 

temp = sub ( ( xMc[i] << 1 ) , 7 ) ; /See note below/ 

temp = temp << 12; 

temp = mult_r ( tempi, temp ); 

temp = add ( temp, temp3 ); 

xMp[i] = temp >> temp2; 
NEXT i; 

NOTE : This subtraction is used to restore the sign of xMc[i]. 

4.2.17 RPE grid positioning 

This procedure computes the reconstructed long term residual signal ep[0..39] for the LTP analysis filter. The inputs are 
the Mc which is the grid position selection and the xMp[0..12] decoded RPE samples which are upsampled by a factor 
of 3 by inserting zero values. 

|== FOR k = to 39: 
I ep[k] = 0; 

|== NEXT k: 

|== FOR i = to 12: 

ep[Mc + (3*i) ] = xMp[i] ; 
|== NEXT i: 

4.2.18 Update of the reconstructed short term residual signal dp[-120..-1] 

This procedure adds the reconstructed long term residual signal ep[0..39] to the estimated signal dpp[0..39] from the 
long term analysis filter to compute the recontsructed short term residual signal dp[-40..-l]; also the reconstructed short 
term residual array dp [-120.. -41] is updated. 
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|== FOR k = to 79: 

dp[-120+k] = dp[-80+k]; 
|== NEXT k: 

== FOR k = to 39: 

I dp[-40+k] = add( ep[k], dpp[k] ); 

|== NEXT k: 

Keep the array dp[-120..-ll in memory for the next sub-segment . 
Initial value : dp[-120..-l]=0; 

4.3 Fixed point implementation of the RPE-LTP Decoder 

Only the synthesis filter and the de-emphasis procedure are different from the procedures found in the RPE-LTP coder. 
Procedures 4.3.1 and 4.3.2 are executed for each sub-segment (four times per frame). Procedures 4.3.3, 4.3.4 and 4.3.5 
are executed once per frame. 

4.3.1 RPE decoding section 

Procedures 4.2. 15 (only the part to get mant and exp of xmaxc), 4.2. 16 and 4.2. 17 are used to obtain the reconstructed 
long term residual signal erp[0..39] signal from the received parameters for each sub-segment (ie Mcr, xmaxcr, 
xmcr[0..12]). 

4.3.2 Long term synthesis filtering 

This procedure uses the bcr and Ncr parameter to realize the long term synthesis filtering. The decoding of bcr needs the 
use of table 4.3b. 

- Nr is the received and decoded LTP lag. 

- An array drp[-120..39] is used in this procedure. 



The elements for -120 to -1 of the array drp are kept in memory for the long term synthesis filter. For each sub-segment 
(40 samples), this procedure computes the drp[0..39] to be fed to the synthesis filter. 

Check the limits of Nr . 

Nr = Ncr; 

IF ( Ncr < 40 ) THEN Nr = nrp; 

IF ( Ncr > 120 ) THEN Nr = nrp; 

nrp= Nr; 

Keep the nrp value for the next sub-segment . 
Initial value : nrp=40; 
Decoding of the LTP gain bcr . 

brp = QLB[bcr] 

Computation of the reconstructed short term residual signal drp[0..391 . 

|== FOR k = to 39: 

drpp = mult r ( brp, drp[k-Nr] ); 
drp[k] = add ( erp[k], drpp ); 

|== NEXT k: 

Update of the reconstructed short term residual signal drp[-l..-1201 . 

|== FOR k = to 119: 

I drp[-120+k] = drp[-80+k]; 

|== NEXT k: 

Keep the array drp[-120..-ll for the next sub-segment . 
Initial value : drp[-120..-l]=0; 
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4.3.3 Computation of the decoded reflection coefficients 

This procedure (which is executed once per frame) is the same as the one described in the CODER part. For decoding 
of the received LARcr[1..8], see procedure 4.2.8. For the interpolation of the decoded Log. -Area Ratios, see procedure 
4.2.9.1 and for the computation of the reflection coefficients np[1..8], see procedure 4.2.9.2. 

4.3.4 Short term synthesis filtering section 

This procedure uses the drp[0..39] signal and produces the sr[0..159] signal which is the output of the short term 
synthesis filter. For ease of explanation, a temporary array wt[0..159] is used. 

Initialisation of the array wt[0..159 1. 

For the first sub-segment in a frame: 

|== FOR k = to 39: 
I wt [k] = drp[k] ; 

|== NEXT k: 

For the second sub -segment in a frame: 

== FOR k = to 39: 
I wt [40+k] = drp[k] ; 
|== NEXT k: 

For the third sub-segment in a frame: 

|== FOR k = to 39: 

I wt [80+k] = drp[k] ; 

|== NEXT k: 

For the fourth sub-segment in a frame: 

|== FOR k = to 39: 

I wt [120+k] = drp[k] ; 

|== NEXT k: 

As the call of the short term synthesis filter procedure can be done in many ways (see the interpolation of the LAR 
coefficient), it is assumed that the computation begins with index k_start (for arrays wt[..] and sr[..]) and stops with 
index k_end (k_start and k_end are defined in 4.2.9.1). The procedure also needs to keep the array v[0..8] in memory 
between calls. 

= FOR k = k_start to k_end: 

sri = wt [k] ; 
=== FOR i = 1 to 8: 

sri = sub ( sri, mult r ( rrp[9-i], v[8-i] ) ); 
v[9-i] = add( v[8-i], mult r ( rrp[9-i], sri ) ); 
=== NEXT i: 

sr [k] = sri; 
v [0] = sri; 
= NEXT k: 

Keep the array v[0..81 in memory for the next call . 
Initial value : v[0..8]=0; 
POSTPROCESSING 



4.3.5 Deemphasis filtering 



FOR k = to 159: 

temp = add ( sr[k], mult_r ( msr, 2 8 1 £ 

msr = temp; 

sro [k] = msr; 
NEXT k: 



Keep msr in memory for the next frame . 
Initial value : msr=0; 
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4.3.6 Upscaling of the output signal 

|== FOR k = to 159: 

srop[k] = add ( sro[k], sro[k] ); 
|== NEXT k: 

4.3.7 Truncation of the output variable 

|== FOR k = to 159: 

srop[k] = srop[k] >> 3; 
I srop[k] = srop[k] << 3; 
|== NEXT k: 

The output format is the following: 

S . v . v . v . v . v . v . v . v . v . v . v . v . . . (2's complement). 

Where S is the sign bit, v a valid bit. 

NOTE : When a linear to A-law compression is needed, then the sub-block COMPRESS of CCITT G721 
recommendation must be used with inputs: 

SR = srop[k] >> 3; 
LAW = 1; 

4.4 Tables used in the fixed point implementation of the rpe-ltp 
coder and decoder 

Table 4.1 : Quantization of the Log. -Area Ratios 



i 


A[l] 


B[i] 


MIC[i] 


MAC[i] 


1 


20480 





-32 


31 


2 


20480 





-32 


31 


3 


20480 


2048 


-16 


15 


4 


20480 


-2560 


-16 


15 


5 


13964 


94 


-8 


7 


6 


15360 


-1792 


-8 


7 


7 


8534 


-341 


-4 


3 


8 


9036 


-1144 


-4 


3 



Table 4.2: Tabulation of 1/A[1..8] 



i 


INVA[i] 


1 


13107 


2 


13107 


3 


13107 


4 


13107 


5 


19223 


6 


17476 


7 


31454 


8 


29708 
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Table 4.3a: Decision level of the LTP gain quantizer 



be 


DLB[bc] 





6554 


1 


16384 


2 


26214 


3 


32767 



Table 4.3b: Quantization levels of the LTP gain quantizer 



be 


QLB[bc] 





3277 


1 


11469 


2 


21299 


3 


32767 



Table 4.4: Coefficients of the weighting filter 



i 


H[i] 





-134 


1 


-374 


2 





3 


2054 


4 


5741 


5 


8192 


6 


5741 


7 


2054 


8 





9 


-374 


10 


-134 



Table 4.5: Normalized inverse mantissa used to compute xM/xmax 



i 


NRFAC[i] 





29128 


1 


26215 


2 


23832 


3 


21846 


4 


20165 


5 


18725 


6 


17476 


7 


16384 



Table 4.6: Normalized direct mantissa used to compute xM/xmax 



i 


FAC[i] 





18431 


1 


20479 


2 


22527 


3 


24575 


4 


26623 


5 


28671 


6 


30719 


7 


32767 
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Digital test sequences 



This chapter provides information on the digital test sequences that have been designed to help in the verification of 
implementations of the RPE-LTP codec. Copies of these sequences are available (see annex 3). 

5.1 Input and output signals 

Table 5.1 defines the input and output signals for the test sequences. The words defined in this table use 16 bits. The left 
or right justification is indicated in the table. The codewords described in the table correspond to one frame of coder 
input or decoder output signal; i.e for 20 ms of input signal the 76 codewords are obtained at the output of the coder and 
76 codewords provided at the input of the decoder will yield 20 ms of output signal in the decoder. 

5.2 Configuration for the application of the test sequences 

Two configurations are appropriate in order to test an implementation of the RPE-LTP coder. The first one is for testing 
the coder part of the RPE-LTP; it means that a sop[..] signal is provided at the input of the encoder that furnishes frames 
of coded parameters. This output has to be checked against some reference file. The other configuration is for testing 
the decoder part of the RPE-LTP; in this case frames of coded parameters (see table 5.1) are sent to the RPE-LTP 
decoder that furnishes the srop[..] signal. These samples have to be checked against a reference file. 



5.2.1 Configuration 1 (encoder only) 



A reset signal (RS) must be applied to the RPE-LTP encoder under test to set all internal variables to the exact states 
specified in section 4 of the present document prior to the start of an input test sequence in order to obtain the correct 
output values for this test. This test must be done in real time with a sampling rate of 8 kHz at the input of the encoder 
under test (see figure 5.1). All the necessary hardware and software should be installed by the user in order to capture in 
real time the output coded parameters of the RPE-LTP encoder and to compare them to the dedicated reference file. 

| 8 kHz sampling rate 
v 



RS > | RPE-LTP j Coded parameters 

I Encoder j / > Comparison 

Input: /-> I under test j 2-7 bits 

13 bits 

Figure 5.1 : Configuration 1 : RPE-LTP encoder under test 



5.2.2 Configuration 2 (Decoder only) 



Figure 5.2 shows a RPE-LTP decoder under test. In the same way as described in the coder part, a reset signal (RS) 
must be used before the processing of the first frame of coded parameters. The decoder must be tested for a continuous 
output with a sampling rate of 8 kHz. At the input of the decoder, the 76 parameters must be sent in a time interval of 20 

ms. 

| 8 kHz sampling rate 
v 



RS >| RPE-LTP i Output signal: srop[k] 

Decoder j / > Comparison 

Input: coded / — >| under test ! 13 bits 

parameters 

2-7 bits 

Figure 5.2: Configuration 2: RPE-LTP decoder under test 
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5.3 Test sequences 

5.3.1 Test sequences for configuration 1 

For configuration 1, four types of input test sequence are provided: 

1 . Sequence for testing the overflow controls in the encoder 

2. Sequence for testing the LPC part of the encoder 

3. Sequence for testing the LTP part of the encoder 

4. Sequence for testing various critical parts of the algorithm 



Sequence 1 uses a large number of saturated samples. The residual LPC signal reaches very high values, which has two 
effects on the processing: 

occurrence of a large number of overflows in addition/subtraction operations. Table 5.2 describes each overflow 
point and the number of occurrences for each. 



the excitation RPE samples have a large dynamic range and the 64 codewords of the sub-block maximum are 
each obtained at least once on output. 



Sequence 2 focuses successively on each reflection coefficient calculated in the Schur recursion. Table 5.3 shows which 
frames deal with which reflection coefficient and its dynamic range. The Log. -Area codewords output by the coder 
cover the full range of their possible values except the 2nd LARc that does not reach the value and 63 (min and max). 
The maximum value (63) is however obtained in sequence 4. 

Sequence 3 tests the long term predictor part of the algorithm. It has been generated by exciting a sharply resonant filter 
with a periodic train of impulses; this produces a pitched signal. Each part corresponding to a given pitch is 128 ms (4 
blocks of 256 words) long. The pitch periods have been randomly drawn in the range [2,15] ms and the random order is 
shown in table 5.4. 

Sequence 4 accounts for various remaining non tested points of the algorithm where implementing errors may be 
suspected. Table 5.5 and 5.6 summarize the critical points that this sequence has been designed to check (ie where the 
three previous sequences were ineffective). Table 5.5 shows the list of tested points where errors can be detected. Each 
tested error is described and the frame number corresponding to the first occurrence of a divergence between the exact 
and the degraded algorithm is also indicated. 

Table 5.6 illustrates three paths of the algorithm that are never explored during the processing of the three previous 
sequences; the table shows which condition leads to each path and the number of associated occurrences in sequence 4. 

Notice finally one point where special care must be taken : 

A small degradation (ie +/- 1) of DLB[2] (the third decision level of the LTP gain quantizer (see table 4.3a) is 
unable to provide any noticeable effect on the output of the four sequences described above). 



5.3.2 Test sequences for configuration 2 

Five types of input test sequence are provided for this configuration. Four sequences obtained in configuration 1 at the 
output of the encoder (coded parameters) are used as input for the decoder under test in configuration 2. 

Table 5.7 gives the list of tested overflow points and their occurrence on sequence 1 for this configuration. 



ETSI 



3GPP TS 06.10 version 4.3.0 GSM Phase 2 43 ETSI TS 100 580-2 V4.3.0 (2001-06) 

Sequence 5 is provided to scan all possible codes for each parameter. This sequence is an artificial sequence and does 
not correspond to any encoder output. The codewords have been randomly generated and cover the entire range of 
codewords values. Moreover, the delay value Nr belonging to [40,120] in an error-free transmission condition, takes in 
this sequence its value in [0,127]. In this case the decoder behaviour on non-allowed values of Nr will be tested. 
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Table 5.1a: Signals used in digital test sequences 

Name | Description | Justification 

+ 

ENCODER INPUT 

+ + 

sop[k] | 13 bits: encoder input signal. I left 

+ + 

OUTPUT PARAMETERS 



LARc [ 1 ] 


6 


bits 


1st 


Log. 


-Area 


Ratio 


LARc [2] 


6 


bits 


2nd 


Log. 


-Area 


Ratio 


LARc [ 3 ] 


5 


bits 


3rd 


Log. 


-Area 


Ratio 


LARc [ 4 ] 


5 


bits 


4th 


Log. 


-Area 


Ratio 


LARc [ 5 ] 


4 


bits 


5th 


Log. 


-Area 


Ratio 


LARc [ 6 ] 


4 


bits 


6th 


Log. 


-Area 


Ratio 


LARc [ 7 ] 


3 


bits 


7th 


Log. 


-Area 


Ratio 


LARc [ 8 ] 


3 


bits 


8th 


Log. 


-Area 


Ratio 



right 
right 
right 
right 
right 
right 
right 
right 



Sub-frame no 1 



Nc 




7 


bits 


LTP lag 


be 




2 


bits 


LTP gain 


Mc 




2 


bits 


RPE grid position 


xmaxc 




6 


bits 


Block amplitude 


xMc [ . 


.12] 


3 


bits 


RPE pulses index to 12 



right 
right 
right 
right 
right 



Nc 




7 


bits 


be 




2 


bits 


Mc 




2 


bits 


xmaxc 




6 


bits 


xMc [0. 


.12] 


3 


bits 



Sub-frame no 2 



LTP lag 

LTP gain 

RPE grid position 

Block amplitude 

RPE pulses index to 12 



right 
right 
right 
right 
right 



Sub-frame no 3 



Nc 




7 


bits 


LTP lag 


be 




2 


bits 


LTP gain 


Mc 




2 


bits 


RPE grid position 


xmaxc 




6 


bits 


Block amplitude 


xMc [ . 


.12] 


3 


bits 


RPE pulses index to 12 



right 
right 
right 
right 
right 



Sub-frame no 4 



Nc 




7 


bits 


LTP lag 


right 


be 




2 


bits 


LTP gain 


right 


Mc 




2 


bits 


RPE grid position 


right 


xmaxc 




6 


bits 


Block amplitude 


right 


xMc [0. 


.12] 


3 


bits 


RPE pulses index to 12 


right 



Table 5.1b: Signals used in digital test sequences 



DECODER OUTPUT 

+ 

srop[k] | 13 bits: decoder output signal. The 3 
I LSB's of the 16 bits are equal to 0. 



- + 

left 
I 
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Table 5.2: List of tested overflow points for sequence 1 (coder part) 



Overflow point 



Short term analysis filter (4.2.10) 
1st add 
2nd add 



No of occurrences 



1059 
134 



LTP parameters computation (4.2.11) 
Abs ( d[k] ) 

Long term analysis filter (4.2.12) 
sub 



APCM quantizer (4.2.15) 

Find max abs of xm: Abs 



11 



Weighting filter (4.2.13) 

scaling the result (both x2 and x4) 302 



49 



Update of Array dp of the long term analysis filter 

(4.2.18) 
add 



Table 5.3 



126 



Reflection 


Coeff. 


Frames 


Dynamic range 








1 




1-135 


-32564,32558 


2 




136-311 


-32356,32242 


3 




316-423 


-32157,32744 


4 




424-524 


-31594,31960 


5 




525-633 


-31697,31735 


6 




634-738 


-30055,31575 


7 




739-839 


-29090,31386 


8 




840-944 


-31052,31208 



Table 5.3 gives the position of the frames dedicated to the study of each reflection coefficient and dynamic range of the 
coefficient for sequence 2 in configuration 2. 

Table 5.4: Pitch periods of sequence 3 (configuration 1) 



86 56 68 120 52 93 20 66 82 115 114 60 42 45 17 64 16 

88 83 63 90 73 23 77 100 33 29 106 35 67 57 103 116 30 

71 69 81 47 32 97 65 62 111 49 109 25 96 50 54 91 85 

99 70 76 46 26 34 104 108 107 22 119 48 58 37 72 110 27 

24 36 87 51 59 38 21 44 113 39 61 53 IS 

112 75 98 118 41 80 31 74 28 84 



37 72 110 

40 94 105 55 



79 43 101 95 IS 



76 



117 92 102 
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Table 5.5: Errors specially detected by sequence 4/Config 1 



Test point 



-+- 



Autocorrelation function 
(4.2.4) 

Computation of the reflect, 
coefficients (4.2.5) 

Quantization and coding 
of the LARs (4.2.7) 



Comput . of the rp from the 
interp. LARp (4.2.9) 

Calc. of the LTP parameters 
->Search of the opt scaling 

(4.2.11) 
->Coding of the LTP gain 

(4.2.11) 



ADPCM inverse quantizer 
(4.2.16) 



Error checked : 
incorrect / correct 
statement / statement 



k=0 to 15i 



I k=0 to 15? 



if ( P [0] <= / if ( P [0] < 
abs(P[l]) ) / abs(P[l]) ) 



A[4] + 1 




/ 


A[4] 


21 


A[5] - 1 




/ 


A[5] 


35 


A[5] + 1 




/ 


A[5] 


430 


A[6] - 1 




/ 


A[6] 


427 


A[8] - 1 




/ 


A[8] 


8 


MAC [ 2 ] - 


1 


/ 


MAC [ 2 ] 


24 


MAC [ 2 ] + 


1 


/ 


MAC [ 2 ] 


516 


11058 




/ 


11059 


19 


20069 




/ 


20070 


25 


1 

k= to 38 


/ 


k= to 39 


1 
32 


mult r 


/ 


mult 


373 


DLB[0] + 


1 


/ 


DLB[0] 


511 


DLB[1] + 


1 


/ 


DLB[1] 


373 


FAC[2] + 


1 


/ 


FAC[2] 


422 


FAC[3] - 


1 


/ 


FAC[3] 


179 


FAC[4] + 


1 


/ 


FAC[4] 


74 


FAC[5] - 


1 


/ 


FAC[5] 


439 


FAC[5] + 


1 


/ 


FAC[5] 


74 


FAC[6] - 


1 


/ 


FAC[6] 


479 


FAC[6] + 


1 


/ 


FAC[6] 


330 


FAC[7] - 


1 


/ 


FAC[7] 


139 



No of the 
1st frame 
with error 



27 



514 



Table 5.6: Paths specially explored by sequence 4/Config 1 



Test point 



Autocorrelation function (4.2.4) 
condition smax == 

Computation of the reflection coefficients 
-> condition L_ACF[0] ==0 (4.2.5) 
-> condition P[0] < abs(P[l]) (4.2.5) 



Number of 
occurrences 



Table 5.7: List of tested overflows points for sequence 1 (decoder part) 



Overflow Point 



Nb of occurrences 
Sequence 1 Sequence 2 Sequence 3 



Long term synthesis filter 
(4.3.2) : add 



126 



Short term synthesis filter 
(4.3.4) : 

1st add: 

2nd add: 



4499 
405 



De-emphasize filter 
(4.3.5) : add 



Scaling of the output signal 
(4.3.6) : add 16691 



339 



19 
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Annex 1 (informative): Codec Performance 
A1.1 Introduction 

The purpose of this annex is to give a broad outline of the performance of the RPE-LTP codec with other parts of the 
digital network. Some general guidance is also offered on non-voice services. 



A1 .2 Speech performance 



Planning rules for digital processes are defined in terms of quantizing distortion units (qdu) which can be realized from 
the following formula (reference 1) using the assumption that the formula accuracy represents the determination of qdus 
from QN measurements: 

QN = 37 - 15 logio(n) ,where n is the qdu (Al.l) 

By definition 1 qdu is the quantization distortion arising from one commercial PCM codec. 

NOTE : The subjective testing methodology to determine QN for the RPE-LTP codec was consistent with current 
CCITT methods (reference 2). 



A1.2.1 Single encoding 



Under error-free transmission conditions the perceived quality of the RPE-LTP codec (see fig Al.l) is lower than both 
codecs conforming to recommendations CCITT G.71 1 and CCITT G.721 (superseded by G.726). Table Al.l indicates 
the relative performance of the codec and can be compared with codecs conforming to recommendations CCITT G.711 
and CCITT G.721 (superseded by G.726). 

The performance of the RPE-LTP codec has been found to be substantially unaffected down to a carrier to interference 
(C/I) ratio of 10 dB, but may be considered to have acceptable performance down to 7 dB. Smaller C/I ratios produce 
unacceptable degradation of speech performance and should be avoided. 

NOTE 1 : It should be noted that there are doubts as to whether the simulations which generated the error pattern 

properly represent real operating conditions. The C/I values quoted should therefore only be considered as 
parameters of this simulation. They may not correspond to real radio interference conditions. Results 
from early GSM validation hardware show that the C/I values which give the performance quoted may be 
several dBs higher. Some error statistics of the simulations are shown in table A1.2. 

NOTE 2 : The real condition C/I = 10 dB is believed to correspond to about 90 % coverage. 

Table A1 .1 : Relative levels of speech performance under error-free conditions 



Codec QN (dB) qdu 

G.711 37 1 

(64 kbit/s, A-law PCM) 

G.721 (superseded by G.726) 
(32 kbit/s, ADPCM) 29 3.5 (*) 

RPE-LTP 23-25 7-8 (*) 

(*) Commercial A-law PCM input and output circuitry included. 

NOTE : The qdu value for the RPE-LTP codec is a conservative estimate. At present there are no specific CCITT 
rules for determining qdus for encoding below 32 kbit/s. 
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Table A1 .2: Bit error statistics for C/l test conditions 



Simulated C/I ratio: 10 dB 7 dB 4 dB 

Total number of errors 

in class I (182 bits 

protected by a 1/2 rate 0.016% 0.61% 4.1% 

code) 

Total number of errors 

in class II (78 bits 4.5% 8.3% 13.0% 

unprotected) 

Number of "frame erasure" 

indications by CRC 1 15 95 

Number of "frame erasures" 

not detected by CRC 1 14 7 6 

NOTE : The total number of frames was 750. CRC means Cyclic Redundancy Check. 

A1 .2.2 Speech performance when interconnected with coding systems on 
an analogue basis 

A1 .2.2.1 Performance with 32 kbit/s ADPCM (G.721 , superseded by G.726) 

The speech performance of the RPE-LTP codec when interconnected with encoding at 32 kbit/s (see fig A1.3 and A1.4) 
decreases in accordance with the formula in section A 1.2, and appears to obey the law of additivity when qdus have 
been determined for the individual codecs. 

A1 .2.2.2 Performance with another RPE-LTP codec 

The speech performance of the RPE-LTP codec when interconnected with another codec of the same type (see fig A1.2) 
is lower than that of Al. 2.2.1. It again appears to obey the law of additivity when qdus have been determined for the 
individual codecs. 

A1 .2.2.3 Performance with encoding other than RPE-LTP and 32 kbit/s ADPCM 
(G.721 , superseded by G.726) 

No information is available on this point, so great care must be exercised when interconnection is made to codecs with 
encoding different from that of Al. 2. 2.1 and Al. 2.2.2. 



A1 .3 Non-speech performance 



It should be noted that the RPE-LTP speech codec is an adaptive system which has been optimised for speech inputs. 
Great care must be taken when making measurements with non-speech signals beause the normal assumptions of time 
invariance and linearity cannot be made. 



A1 .3.1 Performance with single sine waves 



Detailed experiments have shown that the RPE-LTP codec will pass sine waves with segmental signal to noise ratios 
generally in excess of 20 dB in the frequency range of 100 - 2000 Hz. However, in some cases reproduction above 2000 
Hz is not as good. 

It should be noted that sine waves above 1300 Hz may be reproduced with significant fluctuations in amplitude and 
frequency due to the adaptive sub-sampling technique employed. This results in irregularities in the measured frequency 
response. 

A typical frequency response measured with A-law PCM input circuitry is shown in fig A 1.5. If 13 bit linear PCM input 
circuitry is used, the irregularity is less. 
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A1 .3.2 Performance with DTMF tones 

It has been shown that the RPE-LTP codec transfers DTMF signals of 80 ms duration. However, questions like 
minimum allowable signal duration, pause duration and the behaviour in the presence of transmission errors have not 
been investigated. 

A1 .3.3 Performance with information tones 

Experiments have shown that network originated signalling tones, conforming to recommendation CCITT Q.35, are 
easily recognizable when passed through the RPE-LTP codec. 

A1 .3.4 Performance with voice-band data 

Tests have shown that voice-band data transmission does not work satisfactorily with 1200 bit/s modems according to 
recommendation CCITT V.23. Voice-band data according to recommendation CCITT V.21 (300 bit/s) will not be 
subject to any significant degradation. 

This behaviour has been tested for one RPE-LTP link (encoder-decoder). The effect of transmission errors has not been 
tested. 



A1.4 Delay 



The theoretical minimum delay of the RPE-LTP codec is 20 ms. However, practical realizations may have an additional 
processing time in the order of 3 - 8 ms. 



A/D 



A-iaw Codec 

^ encoder w 7 tp 



_^_ A-law 
^* decoder 



D/A 



Figure A1 .1 : One - transcoding scheme 
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Figure A1 .2: Two - transcodings scheme 
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Figure A1.3: Mixed transcodings - scheme 1 
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Figure A1.4: Mixed transcodings - scheme 2 
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Gain (dB) 




Frequency ( Hz) 



Figure A1.5: Frequency response for RPE-LTP codec (with commercial A-law PCM input and output 

circuitry) 

A1 .5 References 

1. CCITT: "Subjective performance assessment of digital processes using the Modulated Noise Reference Unit 
(MNRU)", Annex C, Supplement no 14, Red book, volume V, 1985. 

2. CCITT: "Subjective performance assessment of digital processes using the Modulated Noise Reference Unit 
(MNRU)", Annex A, Supplement no 14, Red book, volume V, 1985. 
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Annex 2 (informative): Subjective relevance of the speech 
coder output bits 

Since no valid objective quality criterion for speech signals is available, the only way to build up such a relevance table 
is to perform listening tests. The procedure described below was used to obtain the relevance classification given in 
table A2. 1 of the recommendation. 

To classify a single bit, say bit i of parameter k, a short speech signal (2 sec) was encoded, then this bit was inverted in 
each frame (the other bits were left unchanged) and the resulting bit stream was fed into the speech decoder. The 
listeners had to compare the quality of the signal with the quality of six reference signals with different levels of 
distortion. Repeating this procedure for all bits would result in a subdivision of the 260 bits into six relevance classes. It 
can be observed that many of the bits have the same physical meaning and it can be expected that bits with the same 
meaning have the same relevance (eg the MSB's of the RPE samples). Relying on this assumption, only one of the 
equivalent parameters was considered. Since there are 13 parameters with different physical meaning with 56 bits in 
total, the number of tests is reduced from 260 to 56. 

The reference signals were the same speech signal distorted by inverting one of the six bits of LAR coefficient number 
one. This resulted in an adequate quantization of distortion levels ranging from "not intelligible" (MSB inverted) to 
"negligible distortion" (LSB inverted). 

The test was carried out using three listeners and one female speaker. Since the three listeners came to rather similar 
results, no more listeners were considered to be required. Averaging the three outcomes led to the relevance table given 
in table A2.1, where the order of all bits between two successive bits of the first parameter (LAR 1) are arbitrarily 
chosen. 

Table A2.1a: Subjective importance of encoded bits (the parameter and bit numbers refer to table 1.1) 



Importance 
class 



Parameter 
name 



Parameter 

number 



Bit number 



1 


Log. area 


ratio 


1 


1 






b6 




Block amplitude 


12,29, 46, 


63 


b53, 


,bl09,bl65,b221 




Log. area 


ratio 


1 


1 






b5 


2 


Log. area 


ratio 


2 


2 






bl2 




Log. area 


ratio 


3 


3 






bl7 




Log. area 


ratio 


1 


1 






M 




Log. area 


ratio 


2 


2 






bll 




Log. area 


ratio 


3 


3 






bl6 




Log. area 


ratio 


4 


4 






b22 




LTP lag 






9,26, 43, 


60 


b43, 


,b99,bl55,b211 


3 


Block amplitude 


12,29, 46 


i, 63 


b52, 


,bl08,bl64,b220 




Log. area 


ratio 


2, 5, 6 


2, 5, 6 






bl0,b26,b30 




LTP lag 






9,26, 43, 


60 


b42, 


,b98,bl54,b210 




LTP lag 






9,26, 43, 


60 


Ml, 


,b97,bl53,b209 




LTP lag 






9,26, 43, 


60 


b40, 


,b96,bl52,b208 




LTP lag 






9,26, 43, 


60 


b39, 


,b95,bl51,b207 




Block amplitude 


12,29, i( 


i, 63 


b51, 


,bl07,bl63,b219 




Log. area 


ratio 


1 


1 






b3 




Log. area 


ratio 


4 


4 






b21 




Log. area 


ratio 


7 


7 






b33 


4 


LTP lag 






9,26, 43, 


60 


b38, 


,b94,bl50,b206 




Log. area 


ratio 


5, 6 


5, 6 






b25,b29 




LTP gain 






10,27, H 


i, 61 


b45, 


,bl01,bl57,b213 




LTP lag 






9,26, 43, 


60 


b37, 


,b93,bl49,b205 



Grid position 



11,28,45,62 b47,bl03,bl59,b215 
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Table A2.1b: Subjective importance of encoded bits (the parameter and bit numbers refer to table 1.1) 



Importance 


Parameter 






Parameter 




Bit number 


class 


name 






number 








Log. area ratio 


1 




1 




b2 




Log. area ratio 


2 


3,8, 


4 2, 3, 8, 4 




b9,bl5,b36,b20 




Log. area ratio 


5 


7 


5, 7 




b24,b32 




LTP gain 






10,27, 44, 


61 


b44,bl00,bl56,b212 




Block amplitude 




12,29, 46, 


63 


b50,bl06,bl62,b218 




RPE pulses 






13. .25 




b56,b59, . .,b92 




RPE pulses 






30. .42 




bll2,bll5, . .,bl48 




RPE pulses 






47. .59 




bl68,bl71, . .,b204 


5 


RPE pulses 






64. .76 




b224,b227, . .,b260 




Grid position 






11,28, 45, 


62 


b46,bl02,bl58,b214 




Block amplitude 




12,29, 46, 


63 


b49,bl05,bl61,b217 




RPE pulses 






13. .25 




b55,b58, . .,b91 




RPE pulses 






30. .42 




blll,bll4, . .,bl47 




RPE pulses 






47. .59 




bl67,bl70, . .,b203 




RPE pulses 






64. .67 




b223,b226,b229,b232 




RPE pulses 






68. .76 




b235,b238, . .,b259 




Log. area ratio 


1 




1 




bl 




Log. area ratio 


2, 


3, 6 


2, 3, 6 




b8,bl4,b28 




Log. area ratio 


7 




7 




b31 




Log. area ratio 


8 




8 




b35 




Log. area ratio 


8 


3 


8,3 




b34,bl3 




Log. area ratio 


4 




4 




bl9 


6 


Log. area ratio 


4 


5 


4, 5 




bl8,b23 




Block amplitude 




12,29, 46, 


63 


b48,bl04,bl60,b216 




RPE pulses 






13. .25 




b54,b57, . .,b90 




RPE pulses 






30. .42 




bll0,bll3, . .,bl46 




RPE pulses 






47. .59 




bl66,bl69, . .,b202 




RPE pulses 






64. .76 




b222,b225, . .,b258 




Log. area ratio 


2 


6 


2, 6 




b7,b27 
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Annex 3 (informative): Format for test sequence distribution 
A3.1 Type of files provided 

Three types of files are provided: 

- Files for input of the encoder : SEQxx.INP 

- Files for input of decoder or comparison with encoder output : SEQxx.COD 

- Files for comparison with the decoder output : SEQxx.OUT 

Two diskettes are provided containing all the digital test sequences. 

The 1st flexible disk contains the SEQ01.INP, SEQ01.COD, SEQ01.OUT, SEQ02.INP, SEQ02.COD, SEQ02.OUT 
files. The 2nd flexible disk contains the SEQ03.INP, SEQ03.COD, SEQ03.OUT, SEQ04.INP, SEQ04.COD, 
SEQ04.OUT, SEQ05.COD, SEQ05.OUT files. Table A3.1 gives the contents of the two distribution flexible disks and 
also the size in bytes and the number of frames for each test sequence file. 



A3. 2 File format description 



All the files are written in binary using 16 bit words. This means that input samples (sop[k]), output samples (srop[k]) 
and coded parameters use 2 bytes each. Hence the sizes of the files are directly related to the number of processed 
frames. 

For files with .INP or .OUT extension type: 

Size (in bytes) = No of frames * 160 * 2; 
For files with .COD extension type: 

Size (in bytes) = No of frames * 76 * 2; 

Table A3.1 shows the size of all the files written in direct binary format. 

The diskette is formatted according to the high capacity (1,2 Mb) specifications for MS/DOS PC- AT compatible 
computers. 

Table A3.1 : Contents of diskettes and size of files 



FILE 


FRAMES 


SIZE(bytes) 


SEQ01.INP 




186 880 


SEQ01.COD 


584 


88 768 


SEQ01.OUT 




186 880 


SEQ02.INP 




303 040 


SEQ02.COD 


947 


143 944 


SEQ02.OUT 




303 040 


SEQ03.INP 




215 360 


SEQ03.COD 


673 


102 296 


SEQ03.OUT 




215 360 


SEQ04.INP 




166 400 


SEQ04.COD 


520 


79 040 


SEQ04.OUT 




166 400 


SEQ05.COD 


64 


9 728 


SEQ05.OUT 




20 480 
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Annex 4 (informative): Test sequence diskette 
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Annex 5 (informative): Change History 



SMG# 


Tdoc 
SMG 


Spec 


CR 


Cat 


PH 


Vers 


New 
Vers 


Subject 


32 272 06.10 A005 


F 


2 


4.1.1 4.2.0 Correction of figure 3.1 


32 




06.10 








4.2.0 


4.2.1 Update to Version 4.2.1 for Publication 



Change history 


Date 


TSG# 


TSG Doc. 


CR 


Rev 


Subject/Comment 


Old 


New 


2001-06 


12 


SP-010303 


A010 




Correction of figure 3.2 


4.2.1 


4.3.0 
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